sign agreement
Decom-Renorm-Merge: Model Merging on the Right Space Improves Multitasking
Chaichana, Yuatyong, Trachu, Thanapat, Limkonchotiwat, Peerat, Preechakul, Konpat, Khandhawit, Tirasan, Chuangsuwanich, Ekapol
In the era of large-scale training, model merging has evolved into a tool for creating multitasking models efficiently. It enables the knowledge of models to be fused, without the need for heavy computation as required in traditional multitask learning. Existing merging methods often assume that entries at identical positions in weight matrices serve the same function, enabling straightforward entry-wise comparison and merging. However, this assumption overlooks the complexity of finetuned neural networks, where neurons may develop distinct feature compositions, making direct entry-wise merging problematic. We present Decom-Renorm-Merge (DRM), a simple yet effective approach that leverages Singular Value Decomposition to decompose and coordinate weight matrices into an aligned joint space, where entry-wise merging becomes possible. We showcase the effectiveness of DRM across various settings ranging from smaller encoder-based such as ViT and DeBERTa, encoder-decoder-based such as T5, and larger decoder-based such as Llama3.1-8B. Our experimental results show that DRM outperforms several state-of-the-art merging techniques across full finetuning and low-rank adaptation settings. Moreover, our analysis reveals renormalization as the crucial component for creating a robust and even joint space for merging, significantly contributing to the method's performance.
- South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
- North America > United States > Washington > King County > Seattle (0.04)
- North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
- (6 more...)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.92)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.89)
Stochastic Amortization: A Unified Approach to Accelerate Feature and Data Attribution
Covert, Ian, Kim, Chanwoo, Lee, Su-In, Zou, James, Hashimoto, Tatsunori
Many tasks in explainable machine learning, such as data valuation and feature attribution, perform expensive computation for each data point and can be intractable for large datasets. These methods require efficient approximations, and learning a network that directly predicts the desired output, which is commonly known as amortization, is a promising solution. However, training such models with exact labels is often intractable; we therefore explore training with noisy labels and find that this is inexpensive and surprisingly effective. Through theoretical analysis of the label noise and experiments with various models and datasets, we show that this approach significantly accelerates several feature attribution and data valuation methods, often yielding an order of magnitude speedup over existing approaches.
- Asia > Middle East > Jordan (0.04)
- Europe > Italy > Marche > Ancona Province > Ancona (0.04)
- North America > United States > California > Santa Clara County > Palo Alto (0.04)
- Europe > Denmark (0.04)
India and Germany likely to sign agreement on artificial intelligence
NEW DELHI: Germany and India are likely to sign agreements including a partnership on the use of artificial intelligence in farming during a three-day visit to New Delhi by Chancellor Angela Merkel that begins on Thursday, the German ambassador said. Merkel will be accompanied by several cabinet colleagues and a business delegation, ambassador Walter J. Lindner told reporters. Merkel and Indian Prime Minister Narendra Modi are expected to discuss trade, investment, regional security and climate change. Both countries could sign agreements in areas such as artificial intelligence and green urban mobility, Lindner said. "This time, the focus will be on economic and trade relations, innovation and digitalisation, and climate protection and sustainable development," Merkel said in a message ahead of the visit released by the Indian embassy in Berlin.